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Abstract 

The Automatic-Flow (AFLOW) standard for the high-throughput construction of 
materials science electronic structure databases is described. Electronic structure 
calculations of solid state materials depend on a large number of parameters which 
must be understood by researchers, and must be reported by originators to ensure 
reproducibility and enable collaborative database expansion. We therefore describe 
standard parameter values for ^-point grid density, basis set plane wave kinetic en¬ 
ergy cut-off, exchange-correlation functionals, pseudopotentials, DFT-i-U parame¬ 
ters, and convergence criteria used in AFLOW calculations. 
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1. Introduction 


The emergence of computational materials science over the last two decades 
has been inextricably linked to the development of complex quantum-mechanical 
codes that enable accurate evaluation of the electronic and thermodynamic proper¬ 
ties of a wide range of materials. The continued advancement of this field entails 
the construction of large open databases of materials properties that can be eas¬ 
ily reproduced and extended. One obstacle to the reproducibility of the data is 
the unavoidable complexity of the codes used to obtain it. Published data usually 
includes basic information about the underlying calculations that allows rough re¬ 
production. However, exact duplication depends on many details, that are seldom 
reported, and is therefore difficult to achieve. 

These difficulties might limit the utility of the databases currently being created 
by high-throughput frameworks, such as AFLOW[l-3] and the Materials Project 
[4, 5]. For maximal impact, the data stored in these repositories must be generated 
and represented in a consistent and robust manner, and shared through standard¬ 
ized calculation and communication protocols. Following these guidelines would 
promote optimal use of the results generated by the entire community. 

The AFLOW (Automatic FLOW) code is a framework for high-throughput 
computational materials discovery [1-3, 6], using separate DFT packages to cal¬ 
culate electronic structure and optimize the atomic geometry. The AFLOW frame¬ 
work works with the VASP[7-10] DFT package, and integration with the Quantum 
ESPRESSO software [11] is currently in progress. The AELOW framework includes 
preprocessing functions for generating input files for fhe DFT package; obfaining 
fhe initial geometric structures by extracting the relevant data from crystallographic 
information files or by generating them using inbuilt prototype databases, and then 
transforming them into standard forms which are easiest to calculate. It then runs 
and monitors the DFT calculations automatically, detecting and responding to cal¬ 
culation failures, whether they are due to insufficient hardware resources or to run¬ 
time errors of the DFT calculation itself. Finally, AFLOW contains postprocessing 
routines to extract specific properties from the results of one or more of the DFT 
calculations, such as the band structure or thermal properties [12]. 

The AFLOWLIB repository [2, 3, 6] was built according to these principles 
of consistency and reproducibility, and the data it contains can be easily accessed 
through a representational state transfer application programming interface (REST- 
API) [3]. In this paper we present a detailed description of the AFLOW standard 
for high-throughput (HT) materials science calculations by which the data in this 
repository was created. 
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2. AFLOW Calculation Types 


The AFLOWLIB consortium[2] repository is divided into databases containing 
calculated properties of over 625,000 materials: the Binary Alloy Project, the Elec¬ 
tronic Structure database, the Heusler database, and the Elements database. These 
are freely accessible online via the AFLOWLIB website [6], as well as through the 
API[3]. The Electronic Structure database consists of entries found in the Inorganic 
Crystal Structures Database, ICSD[13, 14], and will thus be referred to as “ICSD” 
throughout this publication. The Heusler database consists of ternary compounds, 
primarily based on the Heusler structure but with other structure types now being 
added. 

The high-throughput construction of these materials databases relies on a pre¬ 
defined set of standard calculation types. These are designed to accommodate 
the interest in various properties of a given material (e.g. the ground state ionic 
configuration, thermodynamic quantities, electronic and magnetic properties), the 
program flow of fhe HT framework fhaf envelopes fhe DET porfions of fhe cal- 
culafions, as well as fhe practical need for computational robustness. The AFLOW 
standard thus deals with the pai'ameters involved in the following calculation types: 

i. RELAX. Geometry optimizations using algorithms implemented within the DET 
package. This calculation type is concerned with obtaining the ionic config¬ 
uration and cell shape and volume that correspond to a minimum in the total 
energy. It consists of two sequential relaxation steps. The starting point for 
the first step, RELAX 1, can be an entry taken from an external source, such as a 
library of alloy prototypes[15, 16], the ICSD database, or the Pauling Pile[17]. 
These initial entries are preprocessed by AFLOW, and cast into a unit cell that 
is most convenient for calculation, usually the standard primitive cell, in the 
format appropriate for the DET package in use. The second step, RELAX2, uses 
the final ionic posifions from fhe first step as its starting point, and serves as a 
type of annealing step. This is used for jumping out of possible local minima 
resulting from wavefunction artifacts. 

ii. STATIC. A single-point energy calculation. The starting point is the set of 
final ionic posifions, as produced by fhe RELAX2 sfep. The oufcome of fhis 
calculation is used in fhe determination of mosf of fhe fhermodynamic and 
elecfronic properties included in fhe various AFLOW dafabases. If Iherefore 
applies a more demanding sef of paramefers than those used on the RELAX set 
of runs. 

hi. BANDS. Electronic band structure generation. The converged STATIC charge 
density and ionic positions are used as the starting points, and the wavefunc- 
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tions are reoptimized along standardized high symmetry lines connecting spe¬ 
cial A:-points in the irreducible Brillouin zone (IBZ)[18]. 

These calculation types are performed in the order shown above (i.e. RELAXl 
^ RELAX2 ^ STATIC ^ BANDS) on all materials found in the Elements, ICSD, 
and Heusler databases. Those found in the Binary Alloy database contain data 
produced only by the two RELAX calculations. Sets of these calculation types can be 
combined to describe more complex phenomena than can be obtained from a single 
calculation. For example, sets of RELAX and STATIC calculations for different cell 
volumes and/or atomic configurations are used to calculate thermal and mechanical 
properties by the AGL [12] and APL [1] methods inplemented within the AFLOW 
framework. In the following, we describe the parameter sets used to address the 
particular challenges of the calculations included in each AFLOW repository. 

3. The AFLOW standard parameter set 

The standard parameters described in this work are classified according to the 
wide variety of tasks that a typical solid state DFT calculation involves: Brillouin 
zone sampling, Fourier transform meshes, basis sets, potentials, self-interaction 
error (SIE) corrections, electron spin, algorithms guiding SCF convergence and 
ionic relaxation, and output options. 

Due to the intrinsic complexity of the DFT codes it is impractical to specify 
the full set of DFT calculation parameters within an HT framework. Therefore, 
the AFLOW standard adopts many, but not all, of the internal defaults set by the 
DFT software package. This is most notable in the description of the Fourier trans¬ 
form meshes, which rely on a discretization scheme that depends on the applied 
basis and crystal geometry for its specification. Those internal default settings are 
cast aside when error corrections of failed DFT runs, an integral part of AFLOW’s 
functionality, take place. The settings described in this work are nevertheless pre¬ 
scribed as fully as is practicable, in the interest of providing as much information 
as possible to anyone interested in reproducing or building on our results. 

3.1. k-point sampling 

Two approaches are used when sampling the IBZ: the first consists of uni¬ 
formly distributing a large number of k-points in the IBZ, while the second relies 
on the construction of paths connecting high symmetry (special) k-points in the 
IBZ. Within AFLOW, the second sampling method corresponds to the BANDS cal¬ 
culation type, whereas the other calculation types (non-BANDS) are performed using 
the first sampling method. 
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Sampling in non-BANDS calculations is obtained by defining and setting Nkppra, 
the number of A:-points per atom. This quantity determines the total number of k- 
points in the IBZ, taking into account the k-points density along each reciprocal 
lattice vector as well as the number of atoms in the simulation eell, via the relation: 


Nkppra < min 


n«, 


XA^a 


( 1 ) 


Na is the number of atoms in the cell, and the Ni factors correspond to the number 
of sampling points along each reciprocal lattice vector, A,-, respectively. These 
factors define fhe grid resolufion, 6ki\\bi\\INi, which is made as uniform as possible 
under fhe consfrainf of Eqn. 1. The k-poinf meshes are fhen generafed wifhin fhe 
Monkhorsf-Pack scheme[19], unless fhe maferial belongs fo fhe hP, or hR Bravais 
laffices, in which case fhe hexagonal symmefry is preserved by centering the mesh 
at the T-point. 

Default Nkppra values depend on the calculation type and the database. The 
Nkppra values used for the entries in the Elements database are material specific 
and sef manually due fo convergence of fhe fofal energy calculafion. The defaulfs 
applied fo fhe RELAX and STATIC calculations are summarized in Table 1. These 
defaulfs ensure proper convergence of fhe calculafions. They may be foo sfrin- 
genf for some cases buf enable reliable application wifhin fhe HT framework, fhus 
presenting a pracficable balance befween accuracy and calculafion cosf. 


Database 

STATIC 

RELAX 

Binary Alloy 

N.A. 

6000 

Heusler 

10000 

6000 

ICSD 

10000 

8000 


Table 1: Defaull Nkppra values used in non-BANDS calculations. 

Eor BANDS calculations AFLOW generates Brillouin zone integration paths in 
the manner described in a previous publication[18]. The k-point sampling density 
is the line density of k-points along each of the straight-line segments of the path 
in the IBZ. The default setting of AFLOW is 128 k-points along each segment 
connecting high-symmetry k-points in the IBZ for single element structures, and 
20 k-points for compounds. 

The oecupancies at the Fermi edge in all non-RELAX type runs are handled via 
the tetrahedron method with Blochl corrections [20]. This involves the Nkppra 
parameter, as described above. In RELAX type calculations, where the determina¬ 
tion of accurate forces is important, some type of smearing must be performed. In 
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cases where the material is assumed to be a metal, the Methfessel-Paxton approach 
[21] is adopted, with a smearing width of 0.10 eV. Gaussian smearing is used in all 
other types of materials, with a smearing width of 0.05 eV. 

3.2. Potentials and basis set 

The interactions involving the valence electron shells are handled with the po¬ 
tentials provided with the DFT software package. In VASP, these include ultra-soft 
pseudopotentials (USPP)[22, 23] and projector-augmented wavefunction (PAW) 
potentials[24, 25], which are constructed according to the Local Density Approxi¬ 
mation (LDA)[26,27], and the Generalized Gradient Approximation (GGA) PW91[28, 
29] and PBE[30, 31] exchange-correlation (XC) functionals. The ICSD, Binary 
Alloy and Heusler databases built according to the AFLOW standard use the PBE 
functional combined with the PAW potential as the default. The PBE functional 
is among the best studied GGA functionals used in crystalline systems, while the 
PAW potentials are preferred due to their advantages over the USPP methodology. 
Nevertheless, defaults have been defined for a number of potential / XC functional 
combinations, and in the case of the Elements database, results are available for 
EDA, GGA-PW91 and GGA-PBE functionals with both USPP and PAW poten¬ 
tials. Additionally, there are a small number of entries in the ICSD and Binary 
Alloy databases (less than 1% of the total) which have been calculated with the 
GGA-PW91 functional using either the USPP or PAW potential. The exact com¬ 
bination of exchange-correlation functional and potential used for a specific entry 
in the AFLOWLIB database can always be determined by querying the keyword 
dft_type using the AFLOWLIB REST-API [3]. 

DPT packages often provide more than one potential of each type per element. 

The AFLOW standardized lists of PAW and USPP potentials are presented in Ta¬ 
bles 2 and 3, respectively. The “Label” column in these tables corresponds to the 
naming convention adopted by VASP. The checksum of each file lisfed in fhe fables 
is included in fhe accompanying supplemenf for verification purposes. 

Each potential provided wifh fhe VASP package has fwo recommended plane- 
wave kinefic energy cul-off {Ecut) values, fhe smaller of which ensures fhe relia- 
bilify of a calculafion fo wifhin a well-defined error. Additionally, maferials wifh 
more fhan one elemenf fype will have fwo or more sefs of recommended Ecut val¬ 
ues. In fhe AFLOW sfandard, fhe applied Ecut value is fhe largesl found among fhe 
recommendations for all species involved in fhe calculafion, increased by a factor 
of 1.4. 

If is possible to evaluate fhe fhe non-local parfs of fhe potentials in real space, 
rafher fhan in fhe more compufafionally intensive reciprocal space. This approach 
is prone fo aliasing errors, and requires fhe optimization of real-space projecfors if 
fhese are fo be avoided. The real-space projection scheme is mosf appropriafe for 
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larger systems, e.g. surfaces, and is therefore not used in the construction of the 
databases found in the AFLOWLIB repository. 

3.3. Fourier transform meshes 

As mentioned previously, it is not practical to describe the precise default set¬ 
tings that are applied by the AFLOW standard in the specification of the Fourier 
transform meshes. We shall just note that they are defined in terms of the grid 
spacing along each of the reciprocal lattice vectors, o,. These are obtained from 
the set of real space lattice vectors, a,-, via [bib 2 b^]^ = 2n[did2d'^]~^. A distance in 
reciprocal space is then defined by di = WbiW/n,, where fhe sef of n, are fhe number 
of grid poinfs along each reciprocal laffice vecfor, and where fhe fofal number of 
poinfs in fhe simulation is ?ii x ^2 x ^ 3 - 

The VASP package relies primarily on fhe so-called dual grid technique, which 
consisfs of fwo overlapping meshes wifh differenf coarseness. The leasf dense 
of fhe fwo is direcfly dependenf on fhe applied plane-wave basis, Ecut, while fhe 
second is a finer mesh onfo which fhe charge densify is mapped. The AFLOW 
sfandard relies on placing sufficienf poinfs in fhe finer mesh such fhaf wrap-around 
("aliasing") errors are avoided. In ferms of fhe quantify di, defined above, fhe finer 
grid is characferized by di w 0.10 while fhe coarse grid results in di w 0.15 
A“'. These two values are approximate, as there is significant dispersion in these 
quantities across the various databases. 

3.4. DFT+U corrections 

Extended systems containing d and / block elements are often poorly repre¬ 
sented within DFT due to the well known self interaction error (SIE)[27]. The 
influence fhaf fhe SIE has on fhe energy gap of insulators has long been recog¬ 
nized, and several mefhods fhaf accounf for if are available. These include fhe GW 
approximafion[32], fhe rofafionally invarianf approach infroduced by Dudarev[33] 
and Eiechfensfein[34] (denofed here as DFT-i-U), as well as fhe recenfly developed 
ACBNO pseudo-hybrid densify funcfional[35]. 

The DFT-i-U approach is currenfly fhe best suited for high-thr'oughput inves¬ 
tigations, and is therefore included in the AFLOW standard for the entire ICSD 
database, and is also used for certain entries in the Heusler database containing the 
elements O, S, Se, and E. It is not used for the Binary Alloy database. This method 
has a significant dependence on parameters, as each atom is associated with two 
numbers, the screened Coulomb parameter, U, and the Stoner exchange parameter, 
J. These are usually reported as a single factor, combined via Ues = U -J. The set 
of f/eff values associated with the d block elements[18, 36] are presented in Table 
4, to which the elements In and Sn have been added. 
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Element 

Eabel 

Element 

Eabel 

Element 

Eabel 

H 

H 

Se 

Se 

Gdt 

Gd_3 

He 

He 

Br 

Br 

Tb 

Tb_3 

Ei 

Ei_sv 

Kr 

Kr 

Dy 

Dy_3 

Be 

Be_sv 

Rb 

Rb_sv 

Ho 

Ho_3 

B 

B_h 

Sr 

Sr_sv 

Er 

Er_3 

C 

C 

Y 

Y_sv 

Tm 

Tm 

N 

N 

Zr 

Zr_sv 

Yb 

Yb 

0 

0 

Nb 

Nb_sv 

Eu 

Eu 

E 

E 

Mo 

Mo_pv 

Hf 

Hf 

Ne 

Ne 

Tc 

Tc_pv 

Ta 

Ta_pv 

Na 

Na_pv 

Ru 

Ru_pv 

W 

W_pv 

Mg 

Mg_pv 

Rh 

Rh_pv 

Re 

Re_pv 

A1 

A1 

Pd 

Pd_pv 

Os 

Os_pv 

Si 

Si 

Ag 

Ag 

Ir 

Ir 

P 

P 

Cd 

Cd 

Pt 

Pt 

S 

S 

In 

In_d 

Au 

Au 

Cl 

Cl 

Sn 

Sn 

Hg 

Hg 

Ar 

Ar 

Sb 

Sb 

Tl 

Tl_d 

K 

K_sv 

Te 

Te 

Pb 

Pb_d 

Ca 

Ca_sv 

I 

I 

Bi 

Bi_d 

Sc 

Sc_sv 

Xe 

Xe 

Po 

Po 

Ti 

Ti_sv 

Cs 

Cs_sv 

At 

At 

V 

V_sv 

Ba 

Ba_sv 

Rn 

Rn 

Cr 

Cr_pv 

Ea 

Ea 

Pr 

Pr 

Mn 

Mn_pv 

Ce 

Ce 

Ra 

Ra 

Ee 

Pe_pv 

Pr 

Pr 

Ac 

Ac 

Co 

Co 

Nd 

Nd 

Th 

Th_s 

Ni 

Ni_pv 

Pm 

Pm 

Pa 

Pa 

Cu 

Cu_pv 

Sm t 

Sm 

U 

U 

Zn 

Zn 

Sm j: 

Sm_3 

Np 

Np_s 

Ga 

Ga_h 

Eu 

Eu 

Pu 

Pu_s 

As 

As 

Gdt 

Gd 




Table 2: Projector-Augmented Wavefunction (PAW) potentials, parameterized for 
the LDA, PW91, and PBE functionals, included in the AFLOW standard.The PAW- 
PBE combination is used as the default for ICSD, Binary Alloy and Heusler 
databases. 

f: PBE potentials only, f: LDA and PW91 potentials only. 







Element 

Label 

Element 

Label 

Element 

Label 

H 

H_soft 

As 

As 

Tb 

Tb_3 

He 

He 

Se 

Se 

Dy 

Dy_3 

Li 

Li_pv 

Br 

Br 

Ho 

Ho_3 

Be 

Be 

Kr 

Kr 

Er 

Er_3 

B 

B 

Rb 

Rb_pv 

Tm 

Tm 

C 

C 

Sr 

Sr_pv 

Yb 

Yb 

N 

N 

Y 

Y_pv 

Eu 

Eu 

0 

0 

Zr 

Zr_pv 

Hf 

Hf 

F 

F 

Nb 

Nb_pv 

Ta 

Ta 

Ne 

Ne 

Mo 

Mo_pv 

W 

W 

Na 

Na_pv 

Tc 

Tc 

Re 

Re 

Mg 

Mg_pv 

Ru 

Ru 

Os 

Os 

A1 

A1 

Rh 

Rh 

Ir 

Ir 

Si 

Si 

Pd 

Pd 

Pt 

Pt 

P 

P 

Ag 

Ag 

Au 

Au 

S 

S 

Cd 

Cd 

Hg 

Hg 

Cl 

Cl 

In 

In_d 

Tl 

Tl_d 

Ar 

Ar 

Sn 

Sn 

Pb 

Pb 

K 

K_pv 

Sb 

Sb 

Bi 

Bi 

Ca 

Ca_pv 

Te 

Te 

Po 

Po 

Sc 

Sc_pv 

I 

I 

At 

At 

Ti 

Ti_pv 

Xe 

Xe 

Rn 

Rn 

V 

V_pv 

Cs 

Cs_pv 

Fr 

Fr 

Cr 

Cr 

Ba 

Ba_pv 

Ra 

Ra 

Mn 

Mn 

La 

La 

Ac 

Ac 

Fe 

Fe 

Ce 

Ce 

Th 

Th_s 

Co 

Co 

Pr 

Pr 

Pa 

Pa 

Ni 

Ni 

Nd 

Nd 

U 

U 

Cu 

Cu 

Pm 

Pm 

Np 

Np_s 

Zn 

Zn 

Sm 

Sm_3 

Pu 

Pu_s 

Ga 

Ga_d 

Eu 

Eu 



Ge 

Ge 

Gd 

Gd 




Table 3: Ultra-Soft Pseudopotentials (USPP), parameterized for the LDA and 
PW91 functionals, included in the AFLOW standard. 
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A subset of the/-block elements can be found among the systems included in 
the AFLOWLIB consortium databases. We are not aware of the existence of a sys¬ 
tematic search for the best set of U and J parameters for this region of the periodic 
table, so we have relied on an in-house pai‘ameterization[18] in the construction of 
the databases. The values used are reproduced in Table 5. Note that by construc¬ 
tion the SIE correction must be applied to a pre-selected value of the Aquantum 
number, and all elements listed in Table 4 correspond to £ - 2, while those found 
in Table 5 correspond to ^ = 3. 


Element 

f/eff 

Element 

Geff 

Sc [37] 

2.9 

W [38] 

2.2 

Ti [39] 

4.4 

Tc [38] 

2.7 

V [40] 

2.7 

Ru [38] 

3.0 

Cr [41] 

3.5 

Rh [38] 

3.3 

Mn [41] 

4.0 

Pd [38] 

3.6 

Ee [42] 

4.6 

Ag [43] 

5.8 

Co [40] 

5.0 

Cd [44] 

2.1 

Ni [40] 

5.1 

In [44] 

1.9 

Cu [41] 

4.0 

Sn [45] 

3.5 

Zn [44] 

7.5 

Ta [38] 

2.0 

Ga [46] 

3.9 

Re [38] 

2.4 

Sn [45] 

3.5 

Os [38] 

2.6 

Nb [38] 

2.1 

Ir [38] 

2.8 

Mo [38] 

2.4 

Pt [38] 

3.0 

Ta [45] 

2.0 

Au 

4.0 


Table 4: parameters applied to d orbitals. 


Element 

U 

J 

Element 

U 

J 

Ea [47] 

8.1 

0.6 

Dy [48] 

5.6 

0.0 

Ce [49] 

7.0 

0.7 

Tm [50] 

7.0 

1.0 

Pr[51] 

6.5 

1.0 

Yb [52] 

7.0 

0.67 

Nd [53] 

7.2 

1.0 

Eu [47] 

4.8 

0.95 

Sm [53] 

lA 

1.0 

Th [54] 

5.0 

0.0 

Eu [53] 

6.4 

1.0 

U [55] 

4.0 

0.0 

Gd [56] 

6.7 

0.1 





Table 5: U and / parameters applied to selected/-block elements. 
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3.5. Spin polarization 

The first of the two RELAX calculations is always performed in a collinear spin- 
polarized fashion. The initial magnetic moments in this step are set to the number 
of atoms in the system, e.g. 1.0 pB I atom. If the magnetization resulting from 
the RELAXl step is found to be below 0.025 /rB/atom, AFLOW economizes com¬ 
putational resources by turning spin polarization off in all ensuing calculations. 
Spin-orbit coupling is not used in the current AFLOW standard, since it is still too 
expensive to include in a HT framework. 

3.6. Calculation methods and Convergence criteria 

Two nested loops are involved in the DFT calculations used by AFLOW in 
the construction of the databases. The inner loop contains routines that iteratively 
optimize the electronic degrees of freedom (EDOF), and features a number of algo¬ 
rithms that are concerned with diagonalizing the Kohn-Sham (KS) Hamiltonian at 
each iteration. The outer loop performs adjustments to the system geometry (ionic 
degrees of freedom, IDOF) until the forces acting on the system are minimized. 

The convergence condition for each loop has been defined in terms of an energy 
difference, dE. If successive energies resulting from the completion of a loop are 
denoted as Ei-i and E,, then convergence is met when the condition dE ^ Ei-Ei-i 
is fulfilled. Nofe that E, can either be the electronic energy resulting from the inner 
loop, or the configurational energy resulting from the outer loop. The electronic 
convergence criteria will be denoted as dEgi^c, and the ionic criteria as dEion- The 
AFLOW standard relies on dE^iec - 10“^ eV and dEion - 10~^ eV for entries in 
the Elements database. All other databases include calculations performed with 
dEeiec = 10~^ eV and dEion = 10~^ eV. 

Optimizations of the EDGE depend on sets of parameters that fall under three 
general themes: initial guesses, diagonalization methods, and charge mixing. The 
outer loop (optimizations of the IDOE) is concerned with the lattice vectors and 
the ionic positions, and is not as dependent on user input as the inner loops. These 
are described in the following paragraphs. 

3.6. L Electronic degrees of freedom 

The first step in the process of optimizing the EDGE consists of choosing a 
trial charge density and a trial wavefunction. In the case of the non-BANDS-type 
calculations, the trial wavefunctions are initialized using random numbers, while 
the trial charge density is obtained from the superposition of atomic charge den¬ 
sities. The BANDS calculations are not self-consistent, and thus do not feature a 
charge density optimization. In these cases the charge density obtained from the 
previously performed STATIC calculation is used in the generation of the starting 
wavefunctions. 
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Two iterative methods are used for diagonalizing the KS Hamiltonian: the 
Davidson blocked scheme (DBS)[57, 58], and the preconditioned residual mini¬ 
mization method - direct inversion in the iterative subspace (RMM-DIIS)[10]. Of 
the two, DBS is known to be the slower and more stable option. Additionally, 
the subspace rotation matrix is always optimized. These methods are applied in a 
manner that is dependent on the calculation type: 

i. RELAX calculations. Geometry optimizations contain at least one determina¬ 
tion of the system forces. The initial determination consists of 5 initial DBS 
steps, followed by as many RMM-DIIS steps as needed to fulfill the SEgiec 
condition. Later determinations of system forces are performed by a similar 
sequence, but only a single DBS step is applied at the outset of the process. 
Across all databases the minimum of number of electronic iterations for RELAX 
calculations is 2. The maximum number is set to 120 for entries in the ICSD, 
and 60 for all others. 

ii. non-RELAX calculations. In STATIC or BANDS calculations, the diagonaliza- 
tions are always performed using RMM-DIIS. The minimum number of elec¬ 
tronic iterations performed during non-RELAX calculations is 2, and the maxi¬ 
mum is 120. 

If the number of iterations in the inner loop somehow exceed the limits listed 
above, the calculation breaks out of this loop, and the system forces and energy are 
determined. If the d£';o„ convergence condition is not met the calculation re-enters 
the inner loop, and proceeds normally. 

Charge mixing is performed via Pulay’s method[59]. The implementation of 
this charge mixing approach in the VASP package depends on a series of parame¬ 
ters, of which all but the maximum /’-quantum number handled by the mixer have 
been left in their default state. This parameter is modified only in sysfems included 
in fhe ICSD dafabase which confain fhe elemenfs lisfed in Tables 4 and 5. In prac- 
fical ferms, fhe value applied in fhese cases is fhe maximum /-quanfum number 
found in fhe PAW pofenfial, mulfiplied by 2. 

3.6.2. Ionic degrees of freedom and lattice vectors 

The RELAX calculation fype confains deferminafions of fhe forces acting on 
fhe ions, as well as fhe full system stress tensor. The applied algorithm is the 
conjugate gradients (CG) approach[60], which depends on these quantities for the 
full optimization of the system geometry, i.e. the ionic positions, the lattice vectors, 
as well as modifications of the cell volume. The implementation of CG in VASP 
requires minimal user input, where the only independent parameter is the initial 
scaling factor which is always left at its default value. Convergence of the IDOF, 
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as stated above, depends on the value for the SEion parameter, as applied across the 
various databases. The adopted Ecut (see discussion on “Potentials and basis set”, 
section 3.2) makes corrections for Pulay stresses unnecessary. 

Forces acting on the ions and stress tensor are subjected to Harris-Foulkes[61] 
corrections. Molecular dynamics based relaxations are not performed in the con¬ 
struction of the databases found in the AFLOWLIB repository, so any related set¬ 
tings are not applicable to this work. 

3. 7. Output options 

The reproduction of the results presented on the AFLOWLIB website also de¬ 
pends on a select few parameters that govern the output of the DFT package. The 
density of states plots are generated from the STATIC calculation. States are plotted 
with a range of -30 eV to 45 eV, and with a resolution of 5000 points. The band 
structures are plotted according to the paths of k-points generated for a BANDS 
calculation[18]. All bands found between -10 eV and 10 eV are included in the 
plots. 

4. Conclusion 

The AFLOW standard described here has been applied in the automated cre¬ 
ation of the AFLOWLIB database of material properties in a consistent and re¬ 
producible manner. The use of standardized parameter sets facilitates the direct 
comparison of properties between different materials, so that specific trends can 
be identified to assist in the formulation of design rules for accelerated materials 
development. Following this AFLOW standard should allow materials science re¬ 
searchers to reproduce the results reported by the AFLOWLIB consortium, as well 
as to extend on the database and make meaningful comparisons with their own 
results. 
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